A matching algorithm between arbitrary sections of two speech data sets for speech retrieval
نویسنده
چکیده
This paper proposes a new matching algorithm to retrieve speech information from a speech database by speech query that allows continuous input. The algorithm is called Shift Continuous DP (CDP). Shift CDP extracts similar sections between two speech data sets. Two speech data sets are considered as reference patterns that are regarded as a speech database and input speech respectively. Shift CDP applies CDP to a constant length of unit reference patterns and provides a fast match between arbitrary sections in the reference pattern and the input speech. The algorithm allows endless input and real-time responses for the input speech query. Experiments were conducted for conversational speech and the results showed Shift CDP was successful in detecting similar sections between arbitrary sections of the reference speech and arbitrary sections of the input speech. This method can be applied to all kinds of time sequence data such as moving images.
منابع مشابه
An Algorithm for Extracting Similar Partial Utterances toward Flexible Spoken Document Retrieval
This paper proposes a new approach for spoken document retrieval by extracting similar partial utterances for non-segmented and non-recognized data; presentation speech, lecture speech or recorded video, and so on. For this purpose, we propose a new, efficient algorithm that performs fast matching between arbitrary sections of the database and arbitrary sections of query input. It enables searc...
متن کاملAn efficient partial matching algorithm toward speech retrieval by speech
This paper proposes a new efficient partial matching algorithm, called Island Driven Partial Matching (IDPM) based on Continuous Dynamic Programming (CDP), to realize flexible retrieval from a speech database by query speech. IDPM enables detecting the sections in the speech database which match partial sections of the query speech efficiently. IDPM applies CDP to short and constant length of u...
متن کاملAutomatic labeling and digesting for lecture speech utilizing repeated speech by shift CDP
This paper proposes an automatic labeling and digesting method for lecture speech. The method utilizes same sections, such as same words or same phrases that are thought to be important and are repeated in the speech. To extract the same sections, we have proposed a new efficient algorithm, called Shift Continuous DP, because it is an extension of Continuous DP and realizes fast matching betwee...
متن کاملSimilar Section Extraction for the Analysis of the Stream Data Structure
The present paper proposes a new algorithm for discovering similar sections between two time sequence data sets. The algorithm, called Partial Matching Discovery, or PMD, is based on Dynamic Programming. PMD realizes fast matching between arbitrary sections in reference stream data and input stream data and enables the extraction of similar sections in a synchronous manner with the input data. ...
متن کاملSpeech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001